The Effects of Sampling Bias and Model Complexity on the Predictive Performance of MaxEnt Species Distribution Models

نویسندگان

  • Mindy M. Syfert
  • Matthew J. Smith
  • David A. Coomes
چکیده

Species distribution models (SDMs) trained on presence-only data are frequently used in ecological research and conservation planning. However, users of SDM software are faced with a variety of options, and it is not always obvious how selecting one option over another will affect model performance. Working with MaxEnt software and with tree fern presence data from New Zealand, we assessed whether (a) choosing to correct for geographical sampling bias and (b) using complex environmental response curves have strong effects on goodness of fit. SDMs were trained on tree fern data, obtained from an online biodiversity data portal, with two sources that differed in size and geographical sampling bias: a small, widely-distributed set of herbarium specimens and a large, spatially clustered set of ecological survey records. We attempted to correct for geographical sampling bias by incorporating sampling bias grids in the SDMs, created from all georeferenced vascular plants in the datasets, and explored model complexity issues by fitting a wide variety of environmental response curves (known as "feature types" in MaxEnt). In each case, goodness of fit was assessed by comparing predicted range maps with tree fern presences and absences using an independent national dataset to validate the SDMs. We found that correcting for geographical sampling bias led to major improvements in goodness of fit, but did not entirely resolve the problem: predictions made with clustered ecological data were inferior to those made with the herbarium dataset, even after sampling bias correction. We also found that the choice of feature type had negligible effects on predictive performance, indicating that simple feature types may be sufficient once sampling bias is accounted for. Our study emphasizes the importance of reducing geographical sampling bias, where possible, in datasets used to train SDMs, and the effectiveness and essentialness of sampling bias correction within MaxEnt.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

تعیین آستانۀ بهینۀ حضور در مدل‌های پیش‌بینی پراکنش گونه‌های گیاهی (مطالعۀ موردی: مراتع منطقۀ نیر استان یزد)

The current study addresses determination of occurrence optimal thresholds of predictive models of plant species distribution in Nir rangelands of Yazd province. Accordingly, after determination of homogeneous units using digital elevation model and geology maps with scale 1:25000, vegetation sampling was carried out using random systematic method via plots which establishment across 3-5 transe...

متن کامل

Modeling of Artemisia sieberi Besser Habitat Distribution Using Maximum Entropy Method in Desert Rangelands

Predictive modeling of habitat distribution of range plant species and identification of their potential habitats play important roles in the restoration of disturbed rangelands. This study aimed to predict the geographical distribution of Artemisia sieberi and find the influential variables in the distribution of A. sieberi in the desert rangelands of central Iran. Maps of environmental variab...

متن کامل

Prediction of potential habitat distribution of Artemisia sieberi Besser using data-driven methods in Poshtkouh rangelands of Yazd province

The present study aimed to model potential habitat distribution of A. sieberi, and its ecological requirements using generalized additive model (GAM) and classification and regression tree (CART) in in the Poshtkouh rangelands of Yazd province. For this purpose, pure habitats of the species was delineated and the species presence data was recorded by the systematic-randomize sampling method. Us...

متن کامل

MaxEnt versus MaxLike: empirical comparisons with ant species distributions

MaxEnt is one of the most widely used tools in ecology, biogeography, and evolution for modeling and mapping species distributions using presence-only occurrence records and associated environmental covariates. Despite its popularity, the exponential model implemented by MaxEnt does not directly estimate occurrence probability, the natural quantity of interest when modeling species distribution...

متن کامل

Comparison of the predictive performance of two species distribution models GAM and GBM for Thymus kotschyanus in Middle Taleghan Rangelands

In this study, the prediction of Thymus kotschyanus habitat distribution was investigated using two methods of generalized additive regression model (GAM) and Boosted regression trees (BRT) in central part of Taleghan rangelands. Data on vegetation and habitat factors such as topography, climate, geology and soil were collected. For data preparation, samples were taken from the field to record ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 8  شماره 

صفحات  -

تاریخ انتشار 2013